Title: Grafana L3 Administrator | Senior Grafana Engineer Location: Melbourne, FL, Hartford, CT, Dallas/Frisco, TX Contract Overview: We are seeking a highly skilled Grafana L3 Administrator to join our technical operations team. This senior-level position is responsible for the advanced administration, performance optimization, and troubleshooting of enterprise-scale Grafana environments. You will ensure the reliability, scalability, and security of our monitoring dashboards and play a pivotal role in cross-functional DevOps collaboration.
Key Responsibilities: - Administer, configure, and manage Grafana instances, including installations, upgrades, patching, and plugin management.
- Monitor, tune, and optimize Grafana performance, dashboard responsiveness, query efficiency, and data storage.
- Provide L3 (Level 3) support-investigating, troubleshooting, and resolving complex issues to minimize downtime and support business continuity.
- Implement and maintain Grafana security, including user access controls, authentication methods (LDAP, SSO), and encryption practices.
- Design and implement High Availability (HA) and Disaster Recovery (DR) solutions for critical Grafana services.
- Set up and maintain monitoring and alerting using Grafana to proactively identify and address system issues.
- Develop and update detailed documentation, SOPs, and technical knowledge base articles for Grafana administration and resolution procedures.
- Collaborate closely with DevOps, IT Operations, Security, and other technical teams to ensure seamless integration and alignment with organizational requirements.
- Conduct capacity planning, forecasting, and scalability assessments to accommodate growth and ensure optimal performance.
Requirements: - Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
- 6+ years of hands-on experience administering complex, large-scale Grafana environments.
- Deep knowledge of Grafana architecture, deployment, configuration, upgrades, and administration best practices.
- Proactive problem-solving skills with the ability to diagnose and resolve advanced Grafana issues.
- Demonstrated skill in performance tuning, query optimization, and dashboard maintenance.
- Strong understanding of security protocols, encryption, and access controls within Grafana.
- Experience in implementing HA/DR strategies for Grafana deployments.
- Effective communicator and strong collaborator with both technical and non-technical teams.
Preferred Qualifications: - Grafana administrator or related certifications.
- Experience integrating and working with monitoring and time-series data tools such as Prometheus, InfluxDB, or Graphite.